CDS

Accession Number TCMCG064C30838
gbkey CDS
Protein Id XP_011097859.1
Location join(1122931..1123159,1123297..1123379,1123543..1123637,1123959..1124062,1124441..1124610,1124718..1124832,1124995..1125050,1125134..1125172,1125607..1125705,1126299..1126361,1126566..1126628,1126853..1126935,1127103..1127211,1127292..1127345,1127477..1127564,1127793..1127830,1128260..1128365,1128688..1128854,1129160..1129219)
Gene LOC105176677
GeneID 105176677
Organism Sesamum indicum

Protein

Length 606aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268358
db_source XM_011099557.2
Definition imidazole glycerol phosphate synthase hisHF, chloroplastic [Sesamum indicum]

EGGNOG-MAPPER Annotation

COG_category E
Description Belongs to the HisA HisF family
KEGG_TC -
KEGG_Module M00026        [VIEW IN KEGG]
KEGG_Reaction R04558        [VIEW IN KEGG]
KEGG_rclass RC00010        [VIEW IN KEGG]
RC01190        [VIEW IN KEGG]
RC01943        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01663        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00340        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01230        [VIEW IN KEGG]
map00340        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01230        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGACCGCTGTCGAGACGAGCGTAGCACGAGCTGCGGCAAACCAAATGGAGGTGGCGGCGGGAATCACTTCTGCTCCGCAGTTCCCCAAGGCATCCTCTTATTCCTCCTTCTCTTCAGCTTACTCCGGCGGTTCTCATTCTCTACAGTCGCTTCAGTTCAACCCTCTCAAATTCAAACACGCGAGAACTCTCGCAATTCGCGCCTCTGTTTCTGCTGCCGATGACTCCGTGGTGACATTGCTTGATTACGGAGCTGGAAATGTGCGGAGTCTCAGAAATGCAATTCGCTTTCTTGGCTTTAATATAAAAGATGTGCAAAAGCCGGAGGACATTTTGAATGCCAAACGTCTTATTTTTCCAGGTGTAGGTGCATTTGCGCCTGCCATGGATGTGCTAAACAAGACAGGAATGGGAGAAGCACTCTGTTCCTACATTGAGCAAGATCGTCCATTTCTAGGCATTTGTCTTGGATTGCAACTACTGTTTGAGTCAAGTGAGGAAAATGGACCAGTGAAAGGGCTTGGTTTGATTCCTGGGGTAGTTGGTCGTTTTGACTCATCAAATGGTGTCAGGGTACCTCATATTGGTTGGAATGCTATCCACATAACAAGAGAATCTCAGATTTTGGATGATATTGGGAGCCGTCATGTCTATTTTGTTCATTCATACCGCGCAATGCCATCTGATGATAACACAGAATGGGTATCTTCTACGTGCAACTATGGTGACAACTTCATAGCTTCTATAAGAAGGGGAAATGTCCATGCAGTGCAATTTCACCCAGAGAAGAGTGGAGATGTTGGCCTCTCTGTTCTTCGAAGGTTTCTGAATCCCAAGTCTGAAATGACAAAGAAGCCAGCCCAAGGGAAGGCGTCTAAACTTGCAAAGAGGGTCATTGCTTGTCTTGATGTGAGAACAAATGACAAAGGTGATCTTGTTGTAACTAAGGGAGACCAATATGATGTGAGAGAGCACACCAAAGAAAATGAGGTGAGAAACCTCGGCAAGCCAGTGGATCTTGCTGGACAATACTACAAGGATGGGGCTGATGAGGTTAGTTTTCTGAATATTACTGGCTTCCGCGACTTTCCTCTTGGCGATTTACCCATGTTGCAGGTATTAAGGCACGCATCGGAGAATGTTTTTGTCCCATTAACAGTTGGAGGTGGCATTCGAGATTTTACTGATGCAAATGGCAGGTACTACTCGAGTTTGGAGGTTGCTGCAGAGTACTTCCGGTCAGGTGCCGATAAGATTTCTATTGGAAGTGATGCAGTTTATGCTGCAGAAGAATACCTAAAAACAAAAGTAAAATCTGGAAAGAGCAGCCTAGAGCAGATCTCTAGAGTGTATGGAAATCAAGCTGTGGTTGTAAGCATTGATCCTCGTAGAGTGTACTTGAAAGATCCCAAGGATGTAGAGTTCAAGTCCACAAGAGTAACAAACCCAGGTCCAAATGGGGAGCAATATGCTTGGTACCAATGCACGGTGAATGGTGGACGAGAAGGTCGACCAATTGGAGCATATGAGCTTGCAAAAGCTGTTGAAGAACTGGGAGCTGGAGAAATACTCCTAAACTGCATCGACTGTGATGGTCAAGGAAAAGGATATGATATAGATCTGATAAAGCTTATCTCAGATGCTGTAAGTATTCCTGTAATAGCAAGTAGTGGTGCTGGAGCGGTTGAACACTTCTCAGAAGTTTTCTCCAAAACAAATGCATCTGCTGCCCTTGCTGCTGGTATTTTCCACCGGAAGGAGGTGCCTATACAATCTGTGAAAGAGCACTTGCTGCAGGAAGGAATAGAAGTTAGAATGTGA
Protein:  
MTAVETSVARAAANQMEVAAGITSAPQFPKASSYSSFSSAYSGGSHSLQSLQFNPLKFKHARTLAIRASVSAADDSVVTLLDYGAGNVRSLRNAIRFLGFNIKDVQKPEDILNAKRLIFPGVGAFAPAMDVLNKTGMGEALCSYIEQDRPFLGICLGLQLLFESSEENGPVKGLGLIPGVVGRFDSSNGVRVPHIGWNAIHITRESQILDDIGSRHVYFVHSYRAMPSDDNTEWVSSTCNYGDNFIASIRRGNVHAVQFHPEKSGDVGLSVLRRFLNPKSEMTKKPAQGKASKLAKRVIACLDVRTNDKGDLVVTKGDQYDVREHTKENEVRNLGKPVDLAGQYYKDGADEVSFLNITGFRDFPLGDLPMLQVLRHASENVFVPLTVGGGIRDFTDANGRYYSSLEVAAEYFRSGADKISIGSDAVYAAEEYLKTKVKSGKSSLEQISRVYGNQAVVVSIDPRRVYLKDPKDVEFKSTRVTNPGPNGEQYAWYQCTVNGGREGRPIGAYELAKAVEELGAGEILLNCIDCDGQGKGYDIDLIKLISDAVSIPVIASSGAGAVEHFSEVFSKTNASAALAAGIFHRKEVPIQSVKEHLLQEGIEVRM